Search CORE

10 research outputs found

Energy-based Self-attentive Learning of Abstractive Communities for Spoken Language Understanding

Author: Lorré Jean-Pierre
Shang Guokan
Tixier Antoine Jean-Pierre
Vazirgiannis Michalis
Publication venue
Publication date: 07/11/2019
Field of study

Abstractive community detection is an important spoken language understanding task, whose goal is to group utterances in a conversation according to whether they can be jointly summarized by a common abstractive sentence. This paper provides a novel approach to this task. We first introduce a neural contextual utterance encoder featuring three types of self-attention mechanisms. We then train it using the siamese and triplet energy-based meta-architectures. Experiments on the AMI corpus show that our system outperforms multiple energy-based and non-energy based baselines from the state-of-the-art. Code and data are publicly available.Comment: Update baseline

arXiv.org e-Print Archive

HAL-Polytechnique

Speaker-change Aware CRF for Dialogue Act Classification

Author: Lorré Jean-Pierre
Shang Guokan
Tixier Antoine Jean-Pierre
Vazirgiannis Michalis
Publication venue
Publication date: 01/01/2020
Field of study

Recent work in Dialogue Act (DA) classification approaches the task as a sequence labeling problem, using neural network models coupled with a Conditional Random Field (CRF) as the last layer. CRF models the conditional probability of the target DA label sequence given the input utterance sequence. However, the task involves another important input sequence, that of speakers, which is ignored by previous work. To address this limitation, this paper proposes a simple modification of the CRF layer that takes speaker-change into account. Experiments on the SwDA corpus show that our modified CRF layer outperforms the original one, with very wide margins for some DA labels. Further, visualizations demonstrate that our CRF layer can learn meaningful, sophisticated transition patterns between DA label pairs conditioned on speaker-change in an end-to-end way. Code is publicly available

arXiv.org e-Print Archive

Crossref

HAL-Polytechnique

Compréhension du Langage Parlé pour le Résumé Abstractif de Réunion

Author: Shang Guokan
Publication venue: HAL CCSD
Publication date: 28/01/2021
Field of study

With the impressive progress that has been made in transcribing spoken language, it is becoming increasingly possible to exploit transcribed data for tasks that require comprehension of what is said in a conversation. The work in this dissertation, carried out in the context of a project devoted to the development of a meeting assistant, contributes to ongoing efforts to teach machines to understand multi-party meeting speech. We have focused on the challenge of automatically generating abstractive meeting summaries.We first present our results on Abstractive Meeting Summarization (AMS), which aims to take a meeting transcription as input and produce an abstractive summary as output. We introduce a fully unsupervised framework for this task based on multi-sentence compression and budgeted submodular maximization. We also leverage recent advances in word embeddings and graph degeneracy applied to NLP, to take exterior semantic knowledge into account and to design custom diversity and informativeness measures.Next, we discuss our work on Dialogue Act Classification (DAC), whose goal is to assign each utterance in a discourse a label that represents its communicative intention. DAC yields annotations that are useful for a wide variety of tasks, including AMS. We propose a modified neural Conditional Random Field (CRF) layer that takes into account not only the sequence of utterances in a discourse, but also speaker information and in particular, whether there has been a change of speaker from one utterance to the next.The third part of the dissertation focuses on Abstractive Community Detection (ACD), a sub-task of AMS, in which utterances in a conversation are grouped according to whether they can be jointly summarized by a common abstractive sentence. We provide a novel approach to ACD in which we first introduce a neural contextual utterance encoder featuring three types of self-attention mechanisms and then train it using the siamese and triplet energy-based meta-architectures. We further propose a general sampling scheme that enables the triplet architecture to capture subtle patterns (e.g., overlapping and nested clusters).Grâce aux progrès impressionnants qui ont été réalisés dans la transcription du langage parlé, il est de plus en plus possible d'exploiter les données transcrites pour des tâches qui requièrent la compréhension de ce que l'on dit dans une conversation. Le travail présenté dans cette thèse, réalisé dans le cadre d'un projet consacré au développement d'un assistant de réunion, contribue aux efforts en cours pour apprendre aux machines à comprendre les dialogues des réunions multipartites. Nous nous sommes concentrés sur le défi de générer automatiquement les résumés abstractifs de réunion.Nous présentons tout d'abord nos résultats sur le Résumé Abstractif de Réunion (RAR), qui consiste à prendre une transcription de réunion comme entrée et à produire un résumé abstractif comme sortie. Nous introduisons une approche entièrement non-supervisée pour cette tâche, basée sur la compression multi-phrases et la maximisation sous-modulaire budgétisée. Nous tirons également parti des progrès récents en vecteurs de mots et dégénérescence de graphes appliqués au TAL, afin de prendre en compte les connaissances sémantiques extérieures et de concevoir de nouvelles mesures de diversité et d'informativité.Ensuite, nous discutons de notre travail sur la Classification en Actes de Dialogue (CAD), dont le but est d'attribuer à chaque énoncé d'un discours une étiquette qui représente son intention communicative. La CAD produit des annotations qui sont utiles pour une grande variété de tâches, y compris le RAR. Nous proposons une couche neuronale modifiée de Champ Aléatoire Conditionnel (CAC) qui prend en compte non seulement la séquence des énoncés dans un discours, mais aussi les informations sur les locuteurs et en particulier, s'il y a eu un changement de locuteur d'un énoncé à l'autre.La troisième partie de la thèse porte sur la Détection de Communauté Abstractive (DCA), une sous-tâche du RAR, dans laquelle les énoncés d'une conversation sont regroupés selon qu'ils peuvent être résumés conjointement par une phrase abstractive commune. Nous proposons une nouvelle approche de la DCA dans laquelle nous introduisons d'abord un encodeur neuronal contextuel d'énoncé qui comporte trois types de mécanismes d'auto-attention, puis nous l'entraînons en utilisant les méta-architectures siamoise et triplette basées sur l'énergie. Nous proposons en outre une méthode d'échantillonnage générale qui permet à l'architecture triplette de capturer des motifs subtils (p. ex., des groupes qui se chevauchent et s'emboîtent)

Thèses en Ligne

Theses.fr

HAL-Polytechnique

Abstractive Meeting Summarization: A Survey

Author: Hunter Julie
Rennard Virgile
Shang Guokan
Vazirgiannis Michalis
Publication venue: The MIT Press
Publication date: 25/07/2023
Field of study

Abstract A system that could reliably identify and sum up the most important points of a conversation would be valuable in a wide variety of real-world contexts, from business meetings to medical consultations to customer service calls. Recent advances in deep learning, and especially the invention of encoder-decoder architectures, has significantly improved language generation systems, opening the door to improved forms of abstractive summarization—a form of summarization particularly well-suited for multi-party conversation. In this paper, we provide an overview of the challenges raised by the task of abstractive meeting summarization and of the data sets, models, and evaluation metrics that have been used to tackle the problems

HAL-Polytechnique

DATScore: Evaluating Translation with Data Augmented Translations

Author: Kamal Eddine Moussa
Shang Guokan
Vazirgiannis Michalis
Publication venue: Association for Computational Linguistics
Publication date: 01/05/2022
Field of study

International audienc

HAL-Polytechnique

FrugalScore: Learning Cheaper, Lighter and Faster Evaluation Metrics for Automatic Text Generation

Author: Kamal Eddine Moussa
Shang Guokan
Tixier Antoine
Vazirgiannis Michalis
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 16/10/2021
Field of study

International audienc

arXiv.org e-Print Archive

HAL-Polytechnique

FREDSum: A Dialogue Summarization Corpus for French Political Debates

Author: Grari Damien
Hunter Julie
Rennard Virgile
Shang Guokan
Vazirgiannis Michalis
Publication venue: Association for Computational Linguistics
Publication date: 01/12/2023
Field of study

International audienc

HAL-Polytechnique

Automatic Analysis of Substantiation in Scientific Peer Reviews

Author: Clavel Chloé
Guo Yanzhu
Rennard Virgile
Shang Guokan
Vazirgiannis Michalis
Publication venue: Association for Computational Linguistics
Publication date: 01/12/2023
Field of study

International audienc

HAL-Polytechnique

Unsupervised Abstractive Meeting Summarization with Multi-Sentence Compression and Budgeted Submodular Maximization

Author: Ding Wensi
Lorré Jean-Pierre
Meladianos Polykarpos
Shang Guokan
Tixier Antoine
Vazirgiannis Michalis
Zhang Zekun
Publication venue: Association for Computational Linguistics
Publication date: 01/01/2018
Field of study

International audienc

arXiv.org e-Print Archive

Crossref

HAL-Polytechnique

CommunityClick

Author: Adams Brian
Bajko Robert
Bennett William E
Blei David M
Bougouin Adrien
Boulianne Shelley
Bovaird Tony
Brabham Daren C
Bryan Frank M
Bryson John M
Burnard Philip
Carletta Jean
Chiu Patrick
Coleman Stephen
Convertino Gregorio
Corbett Eric
Costa Dora L
D'Ignazio Catherine
D'mello Sidney
Davies Rodrigo
Davis Richard C
Dickinson Jessa
Dickinson Jessa
Ehsaei Amir
Fung Archon
Garg Nikhil
Gastil John
Given Lisa M
Goodman Leo A
Herreid Clyde Freeman
Hirschberg Julia
Innes Judith E
Irvin Renee A
Kalnikaitė Vaiva
Karpowitz Christopher F
Keller Christopher
Kenneth
Levine Peter
Likert Rensis
Lin Chin-Yew
Mahyar Narges
Mehrabian Albert
Mihalcea Rada
Mollborn Stefanie
Morse Judith
Murray Gabriel
Nickerson Rand B
Papineni Kishore
Piolat Annie
Putnam Robert D
Rabiner Lawrence R
Rhody Lisa Marie
Robertson Stephen E
Salgado Mariana
Sanders Lynn M
Shang Guokan
Tas Oguzhan
Torres Lars Hasselblad
Tur Gokhan
Vaportzis Eleftheria
Wen Tsung-Hsien
Whitehead Christopher
Zhang Amy X
Zhong Ming
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref